Crossing the streams: a framework for streaming analysis of short DNA sequencing reads

نویسندگان

  • Qingpeng Zhang
  • Sherine Awad
  • C. Titus Brown
چکیده

5 We present a semi-streaming algorithm for k-mer spectral analysis of 6 DNA sequencing reads, together with a derivative approach that is fully 7 streaming. The approach can also be applied to genomic, transcriptomic, 8 and metagenomic data sets. We develop two tools for short-read analysis 9 based on these approaches, a method for semi-streaming k-mer-based error 10 trimming, and a method for the analysis of error profiles in short reads 11 using a streaming sublinear approach. These tools are implemented in the 12 khmer software package, which is freely available under the BSD License 13 at github.com/ged-lab/khmer/. 14

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Streaming algorithms for identification of pathogens and antibiotic resistance potential from real-time MinION1/cmr/m/n/10 TM sequencing

The recently introduced Oxford Nanopore MinION platform generates DNA sequence data in real-time. This opens immense potential to shorten the sample-to-results time and is likely to lead to enormous benefits in rapid diagnosis of bacterial infection and identification of drug resistance. However, there are very few tools available for streaming analysis of real-time sequencing data. Here, we pr...

متن کامل

Streaming algorithms for identification ofpathogens and antibiotic resistance potentialfrom real-time MinION sequencing

The recently introduced Oxford Nanopore MinION platform generates DNA sequence data in real-time. This opens immense potential to shorten the sample-to-results time and is likely to lead to enormous benefits in rapid diagnosis of bacterial infection and identification of drug resistance. However, there are very few tools available for streaming analysis of real-time sequencing data. Here, we pr...

متن کامل

Streaming algorithms for identification of pathogens and antibiotic resistance potential from real-time MinIONTM sequencing

The recently introduced Oxford Nanopore MinION platform generates DNA sequence data in real-time. This has great potential to shorten the sample-to-results time and is likely to have benefits such as rapid diagnosis of bacterial infection and identification of drug resistance. However, there are few tools available for streaming analysis of real-time sequencing data. Here, we present a framewor...

متن کامل

Transcriptome analysis of the freshwater pearl mussel, Hyriopsis cumingii (Lea) using illumina paired-end sequencing to identify genes and markers

The transcriptome of triangle sail mussel Hyriopsis cumingii (Lea) using Illumina paired-end sequencing technology was conducted and analyzed. Equal quantities of total RNA isolated from six tissues, including gonad, hepatopancreas, foot, mantel, gill and adductor muscle, were pooled to construct a cDNA library. A total of 58.09 million clean reads with 98.48 % Q20 bases were generated. Cluster...

متن کامل

Priority Setting Meets Multiple Streams: A Match to Be Further Examined?; Comment on “Introducing New Priority Setting and Resource Allocation Processes in a Canadian Healthcare Organization: A Case Study Analysis Informed by Multiple Streams Theory”

With demand for health services continuing to grow as populations age and new technologies emerge to meet health needs, healthcare policy-makers are under constant pressure to set priorities, ie, to make choices about the health services that can and cannot be funded within available resources. In a recent paper, Smith et al apply an influential policy studies framework – Kingdon’s multiple str...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PeerJ PrePrints

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2015